Search CORE

15 research outputs found

Federated knowledge base debugging in DL-Lite A

Author: Nolle Andreas
Publication venue
Publication date: 01/01/2021
Field of study

Due to the continuously growing amount of data the federation of different and distributed data sources gained increasing attention. In order to tackle the challenge of federating heterogeneous sources a variety of approaches has been proposed. Especially in the context of the Semantic Web the application of Description Logics is one of the preferred methods to model federated knowledge based on a well-defined syntax and semantics. However, the more data are available from heterogeneous sources, the higher the risk is of inconsistency – a serious obstacle for performing reasoning tasks and query answering over a federated knowledge base. Given a single knowledge base the process of knowledge base debugging comprising the identification and resolution of conflicting statements have been widely studied while the consideration of federated settings integrating a network of loosely coupled data sources (such as LOD sources) has mostly been neglected. In this thesis we tackle the challenging problem of debugging federated knowledge bases and focus on a lightweight Description Logic language, called DL-LiteA, that is aimed at applications requiring efficient and scalable reasoning. After introducing formal foundations such as Description Logics and Semantic Web technologies we clarify the motivating context of this work and discuss the general problem of information integration based on Description Logics. The main part of this thesis is subdivided into three subjects. First, we discuss the specific characteristics of federated knowledge bases and provide an appropriate approach for detecting and explaining contradictive statements in a federated DL-LiteA knowledge base. Second, we study the representation of the identified conflicts and their relationships as a conflict graph and propose an approach for repair generation based on majority voting and statistical evidences. Third, in order to provide an alternative way for handling inconsistency in federated DL-LiteA knowledge bases we propose an automated approach for assessing adequate trust values (i.e., probabilities) at different levels of granularity by leveraging probabilistic inference over a graphical model. In the last part of this thesis, we evaluate the previously developed algorithms against a set of large distributed LOD sources. In the course of discussing the experimental results, it turns out that the proposed approaches are sufficient, efficient and scalable with respect to real-world scenarios. Moreover, due to the exploitation of the federated structure in our algorithms it further becomes apparent that the number of identified wrong statements, the quality of the generated repair as well as the fineness of the assessed trust values profit from an increasing number of integrated sources

MAnnheim DOCument Server

RDF validation requirements - evaluation and logical underpinning

Author: Acar Erman
Bosch Thomas
Eckert Kai
Nolle Andreas
Publication venue
Publication date: 01/01/2015
Field of study

There are many case studies for which the formulation of RDF constraints and the validation of RDF data conforming to these constraint is very important. As a part of the collaboration with the W3C and the DCMI working groups on RDF validation, we identified major RDF validation requirements and initiated an RDF validation requirements database which is available to contribute at http://purl.org/net/rdf-validation. The purpose of this database is to collaboratively collect case studies, use cases, requirements, and solutions regarding RDF validation. Although, there are multiple constraint languages which can be used to formulate RDF constraints (associated with these requirements), there is no standard way to formulate them. This paper serves to evaluate to which extend each requirement is satisfied by each of these constraint languages. We take reasoning into account as an important pre-validation step and therefore map constraints to DL in order to show that each constraint can be mapped to an ontology describing RDF constraints generically

arXiv.org e-Print Archive

VU Research Portal

MAnnheim DOCument Server

Specificatio, Derer/ von denen allhier sitzenden Inquisiten/ Andreas Hempeln und Augustin Nollen/ angegebenen Diebes-Wirthe ...

Author: Hempel Andreas
Nolle Augustin
Publication venue: Universitäts- und Landesbibliothek Sachsen-Anhalt
Publication date
Field of study

KopftitelErscheinungsjahr der Datierung der enthaltenen Schreiben entnomme

Digitale Bibliothek Uni Halle

RDF validation requirements - evaluation and logical underpinning

Author: Acar Erman
Bosch Thomas
Eckert Kai
Nolle Andreas
Publication venue
Publication date: 01/01/2015
Field of study

Fast ABox consistency checking using incomplete reasoning and caching

Author: Meilicke Christian
Nolle Andreas
Paulheim Heiko
Ruffinelli Daniel
Stuckenschmidt Heiner
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

MAnnheim DOCument Server

Schema-based debugging of federated data sources

Author: Chekol Melisachew Wudage
Meilicke Christian
Nemirovski German
Nolle Andreas
Stuckenschmidt Heiner
Publication venue: 'IOS Press'
Publication date: 01/01/2016
Field of study

MAnnheim DOCument Server

The role of reasoning for RDF validation

Author: Acar Erman
Bosch Thomas
Eckert Kai
Hellmann Sebastian
Nolle Andreas
Parreira Josiane Xavier
Polleres Axel
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/09/2015
Field of study

For data practitioners embracing the world of RDF and Linked Data, the openness and flexibility is a mixed blessing. For them, data validation according to predefined constraints is a much sought-after feature, particularly as this is taken for granted in the XML world. Based on our work in the DCMI RDF Application Profiles Task Group and in cooperation with the W3C Data Shapes Working Group, we published by today 81 types of constraints that are required by various stakeholders for data applications. These constraint types form the basis to investigate the role that reasoning and different semantics play in practical data validation, why reasoning is beneficial for RDF validation, and how to overcome the major shortcomings when validating RDF data by performing reasoning prior to validation. For each constraint type, we examine (1) if reasoning may improve data quality, (2) how efficient in terms of runtime validation is performed with and without reasoning, and (3) if validation results depend on underlying semantics which differs between reasoning and validation. Using these findings, we determine for the most common constraint languages which constraint types they enable to express and give directions for the further development of constraint languages

A Parallel Watershed Algorithm

Author: Andreas Bieniek
Andreas Bieniek
Gerald Schreiber
Gerald Schreiber
Hans Burkhardt
Hans Burkhardt
Heiko Marschner
Heiko Marschner
Michael Nolle
Michael Nölle
Technische Informatik I
Technische Informatik I
Technische Universitat Hamburg--harburg
Publication venue
Publication date
Field of study

The watershed transformation is a popular image segmentation algorithm for grey scale images. Sequential watershed algorithms perform a highly data dependent flooding process over the global image. Because of global data dependencies over the sub-domains parallel algorithms which distribute the image over the available processors and simulate the flooding process have a limited speedup. The achievable speedup is highly data dependent. In this paper we show that it is possible to achieve a data independent speedup for images without plateaus. This can be done by inserting temporary labels where the solution depends on neighboring results. We show that the local solutions can be merged to a global solution in a data-independent way. Submitted to: 10th Scandinavian Conference on Image Analysis (SCIA97), Lappeenranta, Finland, June 2-11, 1997 1 Introduction The computation of watershed lines is an important method for the segmentation of grey scale images. Every pixel in the resulting im..

CiteSeerX